Model Selection

Low VRAM Consumption

# Low VRAM Consumption

Qwen3 Reranker 4B W4A16 G128

This is the result of GPTQ quantization on Qwen/Qwen3-Reranker-4B, significantly reducing VRAM usage.

Large Language Model

The Flux model employs a 4-bit Transformer and T5 encoder for text-to-image generation tasks, supporting non-commercial use.

Guanaco 7b Leh V2

A multilingual instruction-following language model based on LLaMA 7B, supporting English, Chinese, and Japanese, suitable for chatbots and instruction-following tasks.

Large Language Model

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase